Search CORE

21 research outputs found

Abstraction Raising in General-Purpose Compilers

Author: Chelini Lorenzo
Publication venue: Eindhoven University of Technology
Publication date: 31/08/2021
Field of study

PET-to-MLIR:A polyhedral front-end for MLIR

Author: Chelini Lorenzo
Corporaal Henk
Jordans Roel
Komisarczyk Konrad
Vadivel Kanishkan
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 08/10/2020
Field of study

We present PET-to-MLIR, a new tool to enter the MLIR compiler framework from C source. The tool is based on the popular PET and ISL libraries for extracting and manipulating quasi-affine sets and relations, and Loop Tactics, a declarative optimizer. The use of PET brings advanced diagnosis and full support for C by relying on the Clang parser. ISL allows easy manipulation of the polyhedral representation and efficient code generation. Loop Tactics, on the other hand, enable us to detect computational motifs transparently and lift the entry point in MLIR, thus enabling domain-specific optimizations in general-purpose code.We demonstrate our tool using the Polybench/C benchmark suite and show that it can lower most of the benchmarks to the MLIR’s affine dialect successfully. We believe that our tool can benefit research in the compiler community by providing an automatic way to translate C code to the MLIR affine dialect

Crossref

Pure OAI Repository

SEER: Super-Optimization Explorer for HLS using E-graph Rewriting with MLIR

Author: Barbalho Rafael
Chelini Lorenzo
Cheng Jianyi
Coward Samuel
Drane Theo
Publication venue
Publication date: 15/08/2023
Field of study

High-level synthesis (HLS) is a process that automatically translates a software program in a high-level language into a low-level hardware description. However, the hardware designs produced by HLS tools still suffer from a significant performance gap compared to manual implementations. This is because the input HLS programs must still be written using hardware design principles. Existing techniques either leave the program source unchanged or perform a fixed sequence of source transformation passes, potentially missing opportunities to find the optimal design. We propose a super-optimization approach for HLS that automatically rewrites an arbitrary software program into efficient HLS code that can be used to generate an optimized hardware design. We developed a toolflow named SEER, based on the e-graph data structure, to efficiently explore equivalent implementations of a program at scale. SEER provides an extensible framework, orchestrating existing software compiler passes and hardware synthesis optimizers. Our work is the first attempt to exploit e-graph rewriting for large software compiler frameworks, such as MLIR. Across a set of open-source benchmarks, we show that SEER achieves up to 38x the performance within 1.4x the area of the original program. Via an Intel-provided case study, SEER demonstrates the potential to outperform manually optimized designs produced by hardware experts

arXiv.org e-Print Archive

Transformations Déclaratives dans le Modèle Polyédrique

Author: Chelini Lorenzo
Grosser Tobias
Zinenko Oleksandr
Publication venue: HAL CCSD
Publication date: 26/12/2018
Field of study

Despite the availability of sophisticated automatic optimizers, performance-critical code sections are in practice still tuned by human experts. Pragma-based languages such as OpenMP or OpenACC are the standard interface to apply such transformations to large code bases and loop transformation pragmas would be a straightforward extension to provide fine-grained control over a compilers loop optimizer. However, the manual optimization of programs via explicit sequences of directives is unlikely to fully solve this problem as expressing complex optimization sequences explicitly results in difficult to read and non-performance-portable code. We address this problem by presenting a novel framework of composable program transformations based on the internal tree-like program representation of a polyhedral compiler. Based on a set of tree matchers and transformers, we describe an embedded transformation language which provides the foundation for the development of program optimization tactics. Using this language, we express core building blocks such as loop tiling, fusion, or data-layout-transformations, and compose them to higher-level transformations expressing algorithm-specific optimization strategies for stencils, dense linear-algebra, etc. We expect our approach to simplify the development of polyhedral optimizers and integration of polyhedral and syntactic approaches.Malgré l’existence d’outils sophistiqués d’optimisation automatique, les parties des programmes dont la performance est cruciale sont toujoursoptimisées manuellement par des humains experts. Les langages basés sur des directives “pragma”, tels que OpenMP ou OpenACC, sont une interface typique pour exprimer les transformations sur des grandes bases de code source. Telles directives pour transformer des nids de boucles seraient une extension naturelle permettant de contrôler l’optimiseur de boucles d’une manière précise. Pourtant l’optimisation manuelle des programmes à travers les séquences des directives de transformation n’est pas toujours souhaitable car ces séquences longues et complexes produisent des programmes peu lisibles et ne bénéficiant pas de la portabilité de performance entre les différentes architectures matérielles. Nous proposons une nouvelle approche pour définir les transformations composables des programmes basée sur la représentation interne d’un compilateur polyédrique sous forme de l’arbre. Grâce à un ensemble des “motifs” et “transformateurs” des arbres, nous décrivons un langage de transformation sur lequel nous basons le développement des tactiques d’optimisation. Avec ce langage, il est possible d’exprimer les transformations basiques, telles que le tuilage, la fusion ou la transposition de données, ainsi que la composition de ces transformations afin de définir une stratégie d’optimisation pour les grandes classes des programmes, telles que les pochoirs, les contractions de tenseurs, etc. Notre approche pourrait simplifier le développement des optimiseurs polyédriques et l’intégration des transformations polyédriques et syntaxique

INRIA a CCSD electronic archive server

TDO-CIM: Transparent Detection and Offloading for Computation In-memory

Author: BanaGozar Ali
Chelini Lorenzo
Corda Stefano
Corporaal Henk
Jordans Roel
Singh Gagandeep
Vadivel Kanishkan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Computation in-memory is a promising non-von Neumann approach aiming at completely diminishing the data transfer to and from the memory subsystem. Although a lot of architectures have been proposed, compiler support for such architectures is still lagging behind. In this paper, we close this gap by proposing an end-to-end compilation flow for in-memory computing based on the LLVM compiler infrastructure. Starting from sequential code, our approach automatically detects, optimizes, and offloads kernels suitable for in-memory acceleration. We demonstrate our compiler tool-flow on the PolyBench/C benchmark suite and evaluate the benefits of our proposed in-memory architecture simulated in Gem5 by comparing it with a state-of-the-art von Neumann architecture.Comment: Full version of DATE2020 publicatio

arXiv.org e-Print Archive

Repository TU/e

Pure OAI Repository

Progressive Raising in Multi-level IR

Author: Chelini Lorenzo
Cohen Albert
Corporaal Henk
Drebes Andi
Grosser Tobias
Vasilache Nicolas
Zinenko Oleksandr
Publication venue: HAL CCSD
Publication date: 27/02/2021
Field of study

International audienceMulti-level intermediate representations (IR) show great promise for lowering the design costs for domain-specific compilers by providing a reusable, extensible, and non-opinionated framework for expressing domain-specific and high-level abstractions directly in the IR. But, while such frameworks support the progressive lowering of high-level representations to low-level IR, they do not raise in the opposite direction. Thus, the entry point into the compilation pipeline defines the highest level of abstraction for all subsequent transformations, limiting the set of applicable optimizations, in particular for general-purpose languages that are not semantically rich enough to model the required abstractions. We propose Progressive Raising, a complementary approach to the progressive lowering in multi-level IRs that raises from lower to higher-level abstractions to leverage domain-specific transformations for low-level representations. We further introduce Multi-Level Tactics, our declarative approach for progressive raising, implemented on top of the MLIR framework, and demonstrate the progressive raising from affine loop nests specified in a general-purpose language to high-level linear algebra operations. Our raising paths leverage subsequent high-level domain-specific transformations with significant performance improvements

INRIA a CCSD electronic archive server

TC-CIM: Empowering Tensor Comprehensions for Computing-In-Memory

Author: Chelini Lorenzo
Cohen Albert
Corporaal Henk
Drebes Andi
Grosser Tobias
Vadivel Kanishkan
Vasilache Nicolas
Zinenko Oleksandr
Publication venue: HAL CCSD
Publication date: 01/01/2020
Field of study

International audienceMemristor-based, non-von-Neumann architectures performing tensor operations directly in memory are a promising approach to address the ever-increasing demand for energy-efficient, high-throughput hardware accelerators for Machine Learning (ML) inference. A major challenge for the programmability and exploitation of such Computing-In-Memory (CIM) architectures consists in the efficient mapping of tensor operations from high-level ML frameworks to fixed-function hardware blocks implementing in-memory computations. We demonstrate the programmability of memristor-based accelerators with TC-CIM, a fully-automatic, end-to-end compilation flow from Tensor Comprehensions, a mathematical notation for tensor operations, to fixed-function memristor-based hardware blocks. Operations suitable for acceleration are identified using Loop Tactics, a declarative framework to describe computational patterns in a poly-hedral representation. We evaluate our compilation flow on a system-level simulator based on Gem5, incorporating crossbar arrays of memristive devices. Our results show that TC-CIM reliably recognizes tensor operations commonly used in ML workloads across multiple benchmarks in order to offload these operations to the accelerator

Pure OAI Repository

INRIA a CCSD electronic archive server

Mixed-species aggregations in arthropods

Author: Allee
Ame
Anne
Arnaud
Ayres
Barrows
Bartelt
Bartelt
Bogner
Boulay
Boulay
Briones-Fourzan
Brodie
Broly
Broly
Broly
Broly
Camazine
Caubet
Charabidze
Charabidze
Chatterton
Chelini
Choe
Copp
Costa
Costa
Courchamp
Courchamp
Dambach
Deneubourg
Devigne
Durieux
Errard
Evans
Everaerts
Farine
Farine
Fielde
Fitzgerald
Fucarino
Giraldeau
Giraldeau
Goff
Grinsted
Gunn
Hagler
Hajek
Hamner
Hassall
Hassall
Heinrich
Hodge
Honek
Huang
Ishiwatari
Ives
Jaenike
Jeanson
Kaplan
Kivelä
Kotov
Krasnov
Krasnov
Krause
Lachmann
Lecchini
Lee
Leoncini
Lihoreau
Lorenzo Figueiras
Lozano-Álvarez
Lozano-Álvarez
Meadows
Mizell
Moreno-Ripoll
Mota
Niassy
Nicolis
Odum
Olmstead
Palestrini
Parrish
Passera
Pasteels
Reis
Riipi
Rivers
Roth
Ryan
Sauphanor
Simpson
Sonenshine
Srinivasan
Stamps
Stanko
Stensland
Stephens
Stuart
Sumpter
Terborgh
Tsunoda
Uvarov
Vauchot
Verhoef
Vienne
Villet
Vulinec
Vulinec
Warburg
Weed
Wertheim
Wertheim
Wilson
Wilson
Wilson
Wilson
Wood
Woodcock
Yoder
Publication venue: 'Wiley'
Publication date: 28/06/2018
Field of study

This review offers the first synthesis of the research on mixed-species groupings of arthropods and highlights the behavioural and evolutionary questions raised by such behaviour. Mixed-species groups are commonly found in mammals and birds. Such groups are also observed in a large range of arthropod taxa independent of their level of sociality. Several examples are presented to highlight the mechanisms underlying such groupings, particularly the evidence for phylogenetic proximity between members that promotes cross-species recognition. The advantages offered by such aggregates are described and discussed. These advantages can be attributed to the increase in group size and could be identical to those of non-mixed groupings, but competition-cooperation dynamics might also be involved, and such effects may differ between homo- and heterospecific groups. We discuss three extreme cases of interspecific recognition that are likely involved in mixed-species groups as vectors for cross-species aggregation: tolerance behaviour between two social species, one-way mechanism in which one species is attractive to others and two-way mechanism of mutual attraction. As shown in this review, the study of mixed-species groups offers biologists an interesting way to explore the frontiers of cooperation-competition, including the process of sympatric speciation.PostprintPeer reviewe

Crossref

DI-fusion

University of St. Andrews - Pure

St Andrews Research Repository

Declarative Loop Tactics for Domain-specific Optimization

Author: Chelini Lorenzo
Corporaal Henk
Grosser Tobias
Zinenko Oleksandr
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/12/2019
Field of study

International audienceIncreasingly complex hardware makes the design of effective compilers difficult. To reduce this problem,we introduceDeclarative Loop Tactics, which is a novel framework of composable program transformationsbased on an internal tree-like program representation of a polyhedral compiler. The framework is based ona declarative C++ API built around easy-to-program matchers and builders, which provide the foundation todevelop loop optimization strategies. Using our matchers and builders, we express computational patternsand core building blocks, such as loop tiling, fusion, and data-layout transformations, and compose them intoalgorithm-specific optimizations. Declarative Loop Tactics (Loop Tactics for short) can be applied to manydomains. For two of them, stencils and linear algebra, we show how developers can express sophisticateddomain-specific optimizations as a set of composable transformations or calls to optimized libraries. By al-lowing developers to add highly customized optimizations for a given computational pattern, we expect ourapproach to reduce the need for DSLs and to extend the range of optimizations that can be performed by acurrent general-purpose compiler

INRIA a CCSD electronic archive server

PET-to-MLIR: A polyhedral front-end for MLIR

Author: Chelini Lorenzo
Corporaal Henk
Jordans Roel
Komisarczyk Konrad
Skavhaug Amund
Trost Andrej
Vadivel Kanishkan
Zemva Andrej
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/10/2020
Field of study

We present PET-to-MLIR, a new tool to enter the MLIR compiler framework from C source. The tool is based on the popular PET and ISL libraries for extracting and manipulating quasi-affine sets and relations, and Loop Tactics, a declarative optimizer. The use of PET brings advanced diagnosis and full support for C by relying on the Clang parser. ISL allows easy manipulation of the polyhedral representation and efficient code generation. Loop Tactics, on the other hand, enable us to detect computational motifs transparently and lift the entry point in MLIR, thus enabling domain-specific optimizations in general-purpose code. We demonstrate our tool using the Polybench/C benchmark suite and show that it can lower most of the benchmarks to the MLIR’s affine dialect successfully. We believe that our tool can benefit research in the compiler community by providing an automatic way to translate C code to the MLIR affine dialect